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Abstract 

We examine a simple biologically-motivated neural network, the three-layer version 
of the Chialvo-Bak 'minibrain' [Neurosci. 90 (1999) 1137], and present numerical 
results which indicate that a non-equilibrium phase transition between ordered and 
disordered phases occurs subject to the tuning of a control parameter. Scale-free 
behaviour is observed at the critical point. Notably, the transition here is due solely 
to network geometry and not any noise factor. The phase of the network is thus a 
design parameter which can be tuned. The phases are determined by differing levels 
of interference between active paths in the network and the consequent accidental 
destruction of good paths. 
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1 Introduction 



Biological neuronal systems are certainly complex — but the source of this com- 
plexity still remains unclear. Bearing in mind the many examples of com- 
plex behaviour arisin g fr om simple rules [H 0, 0, 0, H, E|, a number of au- 
thors 0, 0, H, H 11, EH EH 14 1 have suggested that some such basic 
mechanics might be responsi ble, and a number of neural models have been 
proposed @, Q EH Q EH El 

Il4j which aim to show that even a very simple 



interpretation of biological neural dynamics can nevertheless produce complex 
patterns of learning behaviour. Perhaps the most appealing is that created by 
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Chialvo and Bak 

El 

Il4j : rather cheekily dubbed the 'minibrain' [11 
table for its combined use of extremal ('winner-takes-all') dynamics 
solely negative feedback to create a fast and highly adaptive learning system. 

In this paper we examine the learning patterns of the simplest version of this 
model and present numerical results which indicate that, subject to tuning of 
a control parameter, a non-equilibrium transition is observed in the learning 
dynamics between ordered and disordered phases, similar to that observed in 
traditional Hopfield neural networks [l7, 3, lj| 2(1 21, 2^, Hif; scale-free be- 
haviour is observed at the critical point. Notably, the control parameter in the 
minibrain network is the network geometry: the network phase is therefore a 
design parameter of the system. Although a complete analytic understanding 
of this phenomenon is still to be developed, the underlying mechanism behind 
this behaviour is identified as being interference between active paths in the 
network. The behaviour at the critical point can also be compared to Koho- 
nen's self-organizing map (SOM) 0, where a small stimulus can also lead to 
scale-free modification of the network structure l2i 



2 The Chialvo-Bak 'minibrain' 



Although a variety of alternative network topologies are possible 0, 14 1, the 
simplest version of the minibrain consists of three separate layers of neurons 
(see Figure 1): a layer of rjip inputs, a 'hidden' intermediary layer of size 7/im, 
and a layer of r/op output neurons. Each input neuron has a (one-way) synap- 
tic connection to every intermediary neuron, and similarly each intermediary 
neuron is connected to every output neuron; we will use the terminology 

r = r(?7ip,r7iM,?7op) (!) 
to denote such a network geometry. 

Each synaptic connection is assigned a strength value, initially determined 
randomly: when an input neuron is stimulated, a signal is sent along the single 
strongest connection leading from it, and the intermediary neuron at the end 
of that connection then fires. In the same manner, the output neuron that 
fires is the one at the end of the strongest connection leading from the firing 
intermediary neuron. These extremal ('winner-takes-all') dynamics, inspired 
both by earlier SOC models p and the SOM [lij . mean that we can identify 
an active level of synaptic connections, the strongest connections leading from 
each neuron, that are collectively responsible for all decisions taken by the 
system. As we will see later, the active level is an extremely important aspect 
of the minibrain network. 

Chialvo and Bak assume that synaptic activity leaves behind a tag of some 
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Output neurons 



Fig. 1. A three-layer minibrain network with rjip = 3 input neurons, t^im = 6 inter- 
mediary neurons and 770 p = 4 output neurons. Each input neuron has a (one-way) 
synaptic connection to every intermediary neuron, and each intermediary neuron has 
a similar connection to every output neuron. Using the shorthand of Equation (1), 
this network would be denoted by r(3, 6,4). 



sort (biologically, perhaps a chemical trace) which identifies that synapse as 
'used'; feedback then takes the form of a global signal sent to the system 
(perhaps the release of some hormone) which reacts only with the tagged 
synapses. Following the lead of Barto, Jordan et al. 0, 0, H|, this global 
signal is extremely simple, indicating no more than whether the action taken 
has failed or succeeded. What next? If success is indicated, no further action 
is taken. There is no strengthening of 'good' synapses. If on the other hand the 
feedback signal is one of failure, the tagged synapses are 'punished' by having 
their strengths reduced by a random amount. 

This selectionist interpretation of biological learning dynamics may be con- 
troversial because, in particular, it clashes with the traditional Hcbbian learn- 
ing Q, where good connections are strengthened — and this has traditionally 
had prime place among theories of biological learning. Chialvo and Bak cite 
two main reasons for their alternative viewpoint 0, 3| : firstly, reinforce- 



ment of connections that produce good responses leads to addiction unless some 
mechanism exists to stop the reinforcement once a correct response has been 
learned j25|, whereas punishment of connections that produce bad responses 
by definition stops as soon as a good response is achieved; secondly, learning 
by mistakes is quicker, as a system in an unknown situation is more likely to 
make mistakes than get things right. That selectionism and negative feedback 
may in fact be the primary biological learning mechanism has also been em- 
phasized by a number of other authors in works both experimental jirjl . I27I ] 
and theoretical [28, 29]. A detailed account of the biological motivation for 
the model can be found in 
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Fig. 2. Mean number, (pi), of applications of negative feedback required to learn an 
input-output map of size n, plotted as a function of the number t}im of intermediary 
neurons. Data is averaged over 2048 realizations. 

The minibrain has already been shown to have interesting learning behaviour 0, 

in particular being able to learn nonlinear functions such as the XOR 
problem and its generalization, the parity problem. Also, and importantly, it is 
a highly adaptive model, able to quickly unlearn patterns of behaviour that are 
no longer successful — a direct result of the negative-only feedback [lil, 14. 3cl . 



3 Control parameter for learning and order-disorder phase transi- 
tion 



Arguably the simplest task one can set such a system is the learning of an 
input-output map (e.g., 1— > 1, 2 — > 2, . . . , n — > n): sequentially stimulating 
each input neuron and, if the output is unsatisfactory, applying negative feed- 
back as described above. Chialvo and Bak showed, although without explicit 
quantification, that learning time for such a map (defined as the number of 
times the network has to run through a complete cycle of inputs before com- 
plete success is achieved) decreases as the number rjiu of intermediary neurons 
increases 0]. However, 'learning time' cannot really be thought of as a phys- 
ical quantity within the minibrain network itself, and hence in this paper we 
will use an alternative (though related) measure of learning efficiency: the total 
number of times, pi, that negative feedback must be applied to the system — or 
equivalently, the number of errors made — before we can run through the com- 
plete sequence of inputs without any mistakes. This measure can be thought 
of as analogous to the measure of neural network learning efficiency used in 
early work on the subject (13, H H 0, H H Q using the traditional 
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Fig. 3. Mean number, (pi), of applications of negative feedback (in r/ipr/op unit) 
required to learn input-output maps of varying sizes, plotted as a function of control 
parameter £ = ^im/^ip^op- Note the data collapse as a result of this normalization, 
and the sharp transition between large and small (asymptotic minimum) values 
that occurs at ( « 1. The data legend uses the notation of Equation (1). Data is 
averaged over 2048 realizations. 

Hopfield neural networks: the number of examples the network must see in 
order to learn a general function. 

Fig. 2 shows that in learning an input-output map of the above type (with 
Vip = Vop — n )) the mean number, (pi), of applications of negative feedback 
required scales qualitatively in a similar manner to Chialvo and Bak's learning 
time: as the number of intermediary neurons rjiu increases, (pi) decreases 
towards an eventual asymptotic minimum whose value appears to depend on 
n. This behaviour is quantified precisely in Fig. 3, where we investigate the 
effects of unequal mappings where 7/ip ^ rjop, e.g., i — ► i mod r/op- both the 
minimum value of (pi) and the decrease in negative feedback that occurs 
with increasing r]i M collapse when (pi) and t^m are normalized with respect 
to rjip7]op, and hence we can define a control parameter for learning time, 



It can be seen that at £ « 1 a sharp transition occurs in (pi) between the 
asymptotic minimum and truly immense values. Numerical results (Fig. 4) 
suggest that this is a genuine phase transition: if once the minibrain has 
learned a given input-output mapping, we randomly reassign a new output 
to one of the inputs, and repeat this process, we find that the probability 
function of negative feedback p 1 required to adapt to the new map varies be- 
tween an ordered phase (£ > 1) where there is a maximum cutoff to learning 
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Fig. 4. Probability function p(pi) of the number of applications of negative feedback 
required to adapt to a new input-output map, based on the control parameter (: 

(a) different distributions of learning time for a network with fixed rjip = ryop = 8 
and varying vim, with power-law behaviour at the critical point ( = t/tm/^ipt/op = lj 

(b) power-law (scale-free) behaviour at the critical point for different system sizes, 
with p(pi) ~ Pi 1 ' 3 - Data taken over 10 6 adaptions. 

times, with p\ distributed exponentially, and a disordered phase (( < 1) where 
learning is almost impossible, with p\ strongly biased towards huge values. The 
probability function of pi becomes scale- free at the critical point ( = 1, scaling 
as a power law with exponent —1.3. 

To understand what this means physically for the network, we have to recall 
what the meaning of negative feedback is. Decreasing the strength of a synap- 
tic connection almost certainly removes it from the active level of synapses; 
another connection replaces it. A large number of negative feedback signals 
therefore corresponds to a large change in the geometry of the active level of 
the network — an 'avalanche' in the network landscape, so to speak. But what 
is it that causes such large avalanches? 



The answer lies in path interference, the overlapping of the active paths lead- 
ing from two or more distinct inputs. In the three-layer minibrain, this is as 
simple as the strongest connections from two different inputs pointing to the 
same intermediary neuron. Broadly speaking, if the paths from two or more 
neurons overlap in this fashion, it is likely that (at least) one of the inputs is 
connected to the wrong output neuron; we can make this explicitly true if we 
require that the input-output map to be learned be one-to-one. The negative 
feedback, however, will affect both paths by removing the intermediary-output 
connection from the active level. Thus, a correct wiring of input to output can 
be destroyed by such interference. 
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A brief statistical analysis reveals the behaviour of this factor. Suppose we have 
an arbitrary minibrain network, r(77rp, rjiu, Vop)- Labeling the input neurons 
by 0, 1, 2, ... , rjip — 1, let us define, for a given input i, to be the inter- 
mediary neuron led to by the strongest synaptic connection from i, and let 

be the probability that at least two input neurons in the set {0, 
share the same value of \i. Thus, the probability of having path interference 
in the network is given by Pr = pL p _i- Assuming that the values of \x are ran- 
domly distributed, it follows that Po = and p\ = and more generally 
(assuming ?7 IM > ^ IP , since obviously P r = 1 if this is not the case), 

pEu = pF + ^(i-pD (3) 



Now, substituting for rjiu in terms of the control parameter £, we have: 



&i=ti + -i ±1 7 (l-rf) (4) 
??ip?7opC 



From which we can calculate, 



Pr. 



E 

i=i 



n 



(5) 



Assuming large, fixed rjip and ?7op, the higher-order terms in Equation (5) 
will vanish for ( > 1, and so Pr will scale approximately according to a 
power law of (, with exponent —1. Conversely, if ( < 1 then the higher-order 
terms are brought back into play and Pr scales approximately exponentially; 
Fig. 5 shows nicely how the critical point ( = I acts as a transition point 
between these two scalings. The phase transition can thus be seen as a transi- 
tion between a high-interference phase where learned patterns are frequently 
destroyed by accident as a result of mistakes elsewhere, and a low-interference 
phase where active connections remain largely stable. 

It is interesting to compare this behaviour with the learning patterns ob- 
served in more traditional neural models, for example the large body of work 
on Hopfield networks [l7, 3, 3 2(1 21, 22|, 23|. Typically the performance of 



an iV-input network in learning a function is studied with respect to the num- 
ber P of examples presented to it (analogous to the parameter pi in this work, 
which is, recall, the number of times the minibrain must be shown where it is 
wrong before the error rate decreases to zero). A transition between ordered 
and disordered phases, similar to that observed here, was observed by Amit 
et al. occurring as a result of the temperature of the system; however, 
the behaviour we see in the minibrain is more reminiscent of the behaviour 
observed by Gardner who showed that, for fixed temperature, the phase 



7 




Fig. 5. Probability Pr of finding path interference in various networks 
r(?jip 3 ijim, ijop)i plotted as a function of control parameter £. Note the transition at 
C ~ 1 between exponential and power-law behaviour. Values numerically calculated 
from Equation (4). 

transition in the Hopfield model occurred subject to the connectivity of neu- 
rons. In the minibrain, the path interference driving the phase transition is 
similarly a result solely of the network geometry — but in addition the mini- 



brain is noise- (temperature-) resistant 14|. The network phase can therefore 



be seen as robust with respect to the environment. 



4 Conclusions 



While only a very simple interpretation of real neural dynamics, we have 
seen that the minibrain displays an interesting variety of behaviour, with a 
non-equilibrium transition between ordered and disordered phases occurring 
solely on the basis of network geometry. Thus the network phase is not due to 
external noise but is instead a design component. The system can be tuned — 
for instance, by evolution — to the most advantageous phase. 

The question of which phase is most advantageous, of course, depends on 
the minibrain's surrounding environment. This immediately motivates further 
research into the question of which environments might favour which of the 
different phases. In addition, considering the biological motivation for the 
model, one might want to investigate whether behaviour resembling any (or 
all) of these phases is to be found in real neural systems of different types. 

The results here also emphasize the importance of negative feedback as pro- 
viding behavioural control in learning systems. The accidental destruction of 
good connections that drives the different phases in the minibrain model re- 
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lies on the fact that, because of the negative-only feedback, the active and 
inactive connections in the network are of similar strength flii 14 , l^l • Not 



ljj . 14 ] , they also place in the 



only are such systems faster and more adaptive 
hands of the designer aspects which would otherwise be in the hands of the 
environment. 



In summary, the minibrain's combination of negative-only feedback and ex- 
tremal dynamics can be seen to cause complex behavioural patterns that de- 
pend much more on internal characteristics of the network than any external 
pressures. This simple neural model has already been shown to have inter- 
esting implications for learning systems and the importance of selectionist 
procedures; the results presented here extend this to the wider context of 
phase transitions on networks and their origins. 
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